DWMiner: A Tool for Mining Frequent Item Sets Efficiently in Data Warehouses
نویسندگان
چکیده
This work presents DWMiner, an association rules efficient mining tool to process data directly over a relational DBMS data warehouse. DWMiner executes the Apriori algorithm as SQL queries in parallel, using a database PC Cluster middleware developed for SQL query optimization in OLAP applications. DWMiner combines intraand inter-query parallelism in order to reduce the total time needed to find frequent item sets directly from a data warehouse. DWMiner was tested using the BMS-Web-View1 database from KDD-Cup 2000 and obtained linear and super-linear speedups.
منابع مشابه
Using a Data Mining Tool and FP-Growth Algorithm Application for Extraction of the Rules in two Different Dataset (TECHNICAL NOTE)
In this paper, we want to improve association rules in order to be used in recommenders. Recommender systems present a method to create the personalized offers. One of the most important types of recommender systems is the collaborative filtering that deals with data mining in user information and offering them the appropriate item. Among the data mining methods, finding frequent item sets and ...
متن کاملImproved Maximal Length Frequent Item Set Mining
Association rule mining is one of the most important technique in data mining. Which wide range of applications It aims it searching for intersecting relationships among items in large data sets and discovers association rules. The important of association rule mining is increasing with the demand of finding frequent patterns from large data sources. The exploitation of frequent item set has be...
متن کاملInfrequent Weighted Item Set Mining Using Frequent Pattern Growth
Frequent item set mining is one of the popular data mining techniques and it can be used in many data mining fields for finding highly correlated item sets. Infrequent item set mining finds rarely occurring item sets in the database. Most of the Existing Infrequent item set mining techniques finds infrequent weighted item sets with high computing time and are less scalable when the database siz...
متن کاملGenerating Similar Item Sets Of Temporal Databases Using Spamine Algorithm
Data mining is the process of extracting interesting like non-trivial, implicit, previously unknown and potentially useful information or patterns from large information repositories such as: relational database, data warehouses, XML repository, etc. Data mining is known as one of the core processes of Knowledge Discovery in Database (KDD). Association rule mining is a popular and well research...
متن کاملEfficient Utility Based Infrequent Weighted Item-Set Mining
Association Rule Mining (ARM) is one of the most popular data mining techniques. Most of the past work is based on frequent item-set. In current years, the concentration of researchers has been focused on infrequent item-set mining. The infrequent item-set mining problem is discovering item-sets whose frequency of the data is less than or equal to maximum threshold. This paper addresses the min...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006